Regression Models with Correlated Binary Response Variables: a Comparison of Diierent Methods in Finite Samples

نویسنده

  • Martin Spiess
چکیده

The present paper deals with the comparison of the performance of diierent estimation methods for regression models with correlated binary responses. Throughout, we consider probit models where an underlying latent continous random variable crosses a threshold. The error variables in the unobservable latent model are assumed to be normally distributed. The estimation procedures considered are (1) marginal maximum likelihood estimation using Gauss-Hermite quadrature, (2) generalized estimation equations (GEE) techniques with an extension to estimate tetrachoric correlations in a second step, and, (3) the MECOSA approach proposed by Schepers, Arminger and K usters (1991) using hierarchical mean and covariance structure models. We present the results of a simulation study designed to evaluate the small sample properties of the diierent estima-tors and to make some comparisons with respect to technical aspects of the estimation procedures and to bias and mean squared error of the estima-tors. The results show that the calculation of the ML estimator requires the most computing time, followed by the MECOSA estimator. For small and moderate sample sizes the calculation of the MECOSA estimator is problematic because of problems of convergence as well as a tendency of underestimating the variances. In large samples with moderate or high correlations of the errors in the latent model, the MECOSA estimators are not as eecient as ML or GEE estimators. The higher thètrue' value of an equicorrelation structure in the latent model and the larger the sample sizes are, the more is the eeciency gain of the ML estimator compared to the GEE and MECOSA estimators. Using the GEE approach, the ML estimates of tetrachoric correlations calculated in a second step are biased to a smaller extent than using the MECOSA approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data

The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...

متن کامل

An Overview of the New Feature Selection Methods in Finite Mixture of Regression Models

Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...

متن کامل

Comparison of Ordinal Response Modeling Methods like Decision Trees, Ordinal Forest and L1 Penalized Continuation Ratio Regression in High Dimensional Data

Background: Response variables in most medical and health-related research have an ordinal nature. Conventional modeling methods assume predictor variables to be independent, and consider a large number of samples (n) compared to the number of covariates (p). Therefore, it is not possible to use conventional models for high dimensional genetic data in which p > n. The present study compared th...

متن کامل

Extension of Logic regression to Longitudinal data: Transition Logic Regression

Logic regression is a generalized regression and classification method that is able to make Boolean combinations as new predictive variables from the original binary variables. Logic regression was introduced for case control or cohort study with independent observations. Although in various studies, correlated observations occur due to different reasons, logic regression have not been studi...

متن کامل

به کارگیری مدل‌های رگرسیون لجستیک ترتیبی در مطالعات کیفیت زندگی

 Background & Objectives: Due to the increasing tendency to measure the quality of life in recent years and the extensive quality of life questionnaires, it is important to determine the appropriate method of analyzing data derived from these studies. The aim of the present study was to introduce ordinal logistic regression models as an appropriate method for analyzing the data of quality of li...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995